Prediction of Protein Dispensability through Integrated Analysis of Multiple-Source High-Throughput Data
نویسندگان
چکیده
Protein dispensability is fundamental to understanding of gene function and evolution. It is usually studied at the individual gene phenotype level. Recent advances in generating high-throughput data such as genomic sequence data, protein-protein interaction data, gene-expression data, and growth-rate data of mutants allow us to investigate protein dispensability systematically at the genome scale. In our studies, protein dispensability was represented as a fitness score that was measured by the growth rate of gene-deletion mutants. Through analyses of high-throughput data in yeast Saccharomyces cerevisia, we found that a protein’s dispensability had significant correlations with its evolutionary rate and duplication rate, as well as its connectivity in protein-protein interaction network and gene-expression correlation network. Correspondence analysis also showed such significant dependencies, which imply that the integration of high-throughput data can provide substantial information on protein dispensability. Thus neural network and support vector machines were applied to predict protein dispensability. Our study provides a “proof-of-principle” for a global understanding of protein dispensability through computational analyses of high-throughput data.
منابع مشابه
Understanding protein dispensability through machine-learning analysis of high-throughput data
MOTIVATION Protein dispensability is fundamental to the understanding of gene function and evolution. Recent advances in generating high-throughput data such as genomic sequence data, protein-protein interaction data, gene-expression data and growth-rate data of mutants allow us to investigate protein dispensability systematically at the genome scale. RESULTS In our studies, protein dispensab...
متن کاملPerformance Improvement of Expanded Integrated Local Area Networks (RESEARCH NOTE)
In Local Area Networks (LAN) connected together by bridges, flow control and smooth traffic in the network is very important. However, congestion at bridges can cause intensive loss of received frames. In addition, the received frames are thrown away and have to be retransmitted by the source station, which causes more congestion and massive reduction in the overall network throughput. The netw...
متن کاملThroughput Maximization for Multi-Slot Data Transmission via Two-Hop DF SWIPT-Based UAV System
In this paper, an unmanned aerial vehicle (UAV) assisted cooperative communication system is studied, wherein a source transmits information to the destination through an energy harvesting decode-and-forward UAV. It is assumed that the UAV can freely move in between the source-destination pair to set up line of sight communications with the both nodes. Since the battery of the UAV may be limite...
متن کاملFunctional genomic analysis of the rates of protein evolution.
The evolutionary rates of proteins vary over several orders of magnitude. Recent work suggests that analysis of large data sets of evolutionary rates in conjunction with the results from high-throughput functional genomic experiments can identify the factors that cause proteins to evolve at such dramatically different rates. To this end, we estimated the evolutionary rates of >3,000 proteins in...
متن کاملPutting microarrays in a context: Integrated analysis of diverse biological data
In recent years, multiple types of high-throughput functional genomic data that facilitate rapid functional annotation of sequenced genomes have become available. Gene expression microarrays are the most commonly available source of such data. However, genomic data often sacrifice specificity for scale, yielding very large quantities of relatively lower-quality data than traditional experimenta...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2004